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SYSTEM AND METHOD FOR CAPTURE AND UTILIZATION OF 
CONTENT AND SOURCE INFORMATION 

FIELD OF THE INVENTION 

The present disclosure relates to a system and method for capture and 
utilization of content and source information. More particularly, the disclosure relates 
to a system and method with which content contained within a hardcopy source, such 
as a book or magazine, can be captured and integrated with a user application, such as 
a word processing application, along with information about the source of the 
captured content. 

BACKGROUND OF THE INVENTION 

Researchers often collect information (i.e., content) from various different 
sources when conducting research on a particular topic. Once this information is 
collected, the researcher often then creates a document describing his or her findings. 
For instance, students often consuft various library books when conducting research 
for assignments in which the student is required to write a paper on a particular topic. 

Where the researcher relies upon information located in one or more sources in 
creating the research document, the researcher normally provides acknowledgements 
of the sources for the located information reflected in the document. For example. 
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where general concepts found in a particular source are reflected in the document, the 
researcher may provide a bibHography identifying the source as providing the basis for 
the document. Where a quote is provided from the source or a particular piece of 
information (e.g., a fact or idea) expressed in the source is reproduced, the researcher 

5 may flirther provide a footnote or endnote that identifies the origin of the quote or 
other piece of information. 

When the research is conducted, the researcher typically records information 
located during the research on which the researcher may later rely in creating a 
document. For instance, the researcher may take handwritten notes of the content 

10 expressed in the source or may photocopy one or more pages of the source for future 
reference. In the former case, recordation of the information can be tedious, 
particularly where large amounts of content are being taken fi-om the source, hi the 
latter case, the researcher must normally tab the various passages of the sources, carry 
them to a copy machine, make the photocopies (typically in exchange for a fee), and 

1 5 then return the sources. 

In addition to recording the content found in the sources, the researcher must 
normally further record bibliographic information concerning the sources so that, if 
this content is later used in creating the document, proper acknowledgement of the 
sources can be provided. Recordation of this information can also be tedious for the 

20 researcher whether the researcher is taking handwritten notes or using a photocopier. 
In the former case, the researcher must vmte down the bibliographic information of 
the source, including an identification of particular pages where particular quotes or 
other specific pieces of information were found, hi recording this information, the 
researcher must be careful to collect all relevant bibliography information so that 
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proper acknowledgement can later be provided. In the photocopying context, the 
researcher must photocopy the bibliographic information {e.g., contained on the title 
page of the source) and keep accurate records as to which photocopied bibliographic 
information pertains to which photocopied content. 

5 Once the researcher has completed his or her research and is prepared to create 

a document describing his or her findings, the researcher must organize the content 
and attendant bibliographic information and then insert this information in the correct 
format within the document. Li that this information is not in electronic form, this 
task involves manually reproducing the information in the document, for instance, by 

10 transcription using a keyboard. This manual transcription provides an opportunity for 
various mistakes to be made. For instance, where a quote from a particular source is 
to be used, it is possible for the researcher to incorrectly transcribe the quoted 
material. The researcher may also mis-transcribe the bibhographic information that 
pertains to the sources. Furthermore, where the user failed to record all such 

15 bibliographic information, the researcher may need to return to the source and 
rerecord this information. 

In view of the foregoing, it can be appreciated that it would be desirable to have 
a system and method that, at least partially, automates the process of collecting 
content and source information that identifies the origin of the content. Furthermore, 

20 it would be desirable to have a system and method that, at least partially, automates 
integration of the content and source information into a document. 
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SUMMARY OF THE INVENTION 



The present disclosure provides a system and method for capture and utilization 
of content source information. 

5 In one arrangement, the method comprises the steps of electronically capturing 

content, electronically capturing source information pertinent to the source of the 
captured content, associating the content and the source information; and transmitting 
the content and source information to a device for manipulation. 

By way of example, this method can be practiced with a handheld scanning 

10 device. Such a device can comprise a housing configured as a pen, a scan head that is 
adapted to capture information from a source, and memory including an information 
association module that is configured to associate captured content with captured 
source information. 

In one arrangement, the method comprises the steps of receiving content and 
15 associated source information pertinent to the source of the content in electronic form, 
reconfiguring the content and associated source information for use in a user 
application, and automatically creating at least one source acknowledgement in the 
user application. 

Other methods, systems, features, and advantages of the invention will become 
20 apparent upon reading the following specification, when taken in conjunction with the 
accompanying drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The invention can be better understood with reference to the following drawings. 
The components in the drawings are not necessarily to scale, emphasis instead being 
placed upon clearly illustrating the principles of the invention. 

FIG. 1 is a schematic view of the inventive concept. 

FIG. 2 is a schematic view of a system for capture and utilization of content and 
source information. 

FIG. 3 is a schematic view of a scanning device shown in FIG. 2. 

FIG. 4 is a schematic view of memory of the scanning device shown in FIG. 3. 

FIG. 5 is a schematic view of a computing device shown in FIG. 2. 

FIGS. 6A and 6B provide a flow diagram that illustrates an example of operation 
of the scanning device shown in FIG. 3. 

FIG. 7 provides a flow diagram that illustrates an example of operation of an 
information manager of the computing device shown in FIG. 5. 

DETAILED DESCRIPTION 

As noted above, there are several disadvantages associated with conventional 
research and document preparation in the research context. Accordingly, the present 
disclosure describes a system and method that, at least partially, automates the process 
of collecting content and source information as well as integrating the information 
with a user application such as a word processing application. The general concept 
100 of the invention is illustrated in FIG. 1. As shown in this figure, while conducting 
his or her research, the researcher discovers various information 102 that the 
researcher may wish to use in creating a document that describes his or her findings. 
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This information can include content 104, source information 106 associated with the 
discovered content, and content location information 108 that identifies where {e.g., 
on what page) the content was found. The content 104 can comprise any information 
that the researcher may wish to use in creating the document. For instance, the 

5 content 104 can comprise text that the researcher will use to write the document 
and/or that will be inserted into the document, for example as a quotation. The source 
information 106 typically comprises bibliographic information used to acknowledge 
the source in the research document. Accordingly, this information 106 typically 
comprises a title of the source {e.g., book or periodical), author, pubhsher, publication 

10 date, etc. 

Once the content 104, source information 106, and content location information 
108 is located by the researcher, it is recorded by the researcher so that he or she can 
later refer to this information and use it to prepare the research document. Due to the 
disadvantages associated with conventional methods of recordation of this type of 

15 information, the information 100 is electronically captured according to the present 
system and method. This is preferably accomphshed by scanning the information 100. 
As for the content 104, various passages contained within the source can be scanned 
with an appropriate scanning device and stored within the scanning device. As for the 
source information 106, this information can be captured by scanning data code 

20 information, such as bar code information, provided on the source. For instance, where 
the source comprises a book, a bar code containing all bibUographic information can be 
provided on the binder of the book or title page. Alternatively, the bar code information 
can comprise look-up information with which the bibliographic information can later be 
retrieved for integration into the research document. Finally, the content location 



information 108 pertaining to each piece of located content 104 can be captured with the 
scanning device. For instance, the page numbers that identify the pages on which the 
located content 104 was found can be captured, if desired. 

Once the various information 100 is captured as described above, various data 
5 processing can be performed on or with the information, as indicated in block 1 10. This 
processing can comprise character recognition 112, information association 114, 
information retrieval 116, and data reconfiguration 118. By way of example, character 
recognition 108 can comprise optical character recognition (OCR) of the captured 
content 104 and content location information 108 so that this information can later be 
10 directly imported in the research document, as desired. The information association 112 
can comprise association of the content 104 with the source information 106 and content 
location information 108. The information retrieval 116 can comprise retrieval of 
bibliographic information where the data code information comprises look-up 
information. Finally, data reconfiguration 118 can comprise reconfiguration of the 
1 5 captured and retrieved information for integration with a user application 120. 

Once the various data processing has been performed, the content 104, 
bibhographic information and content location information 108 can be integrated into 
one or more user applications 120. By way of example, the user application 120 can 
comprise a word processing apphcation that the researcher uses to create the research 
20 document. In such a situation, the application 120 may exist and run on a separate 
computing device, for instance a personal computer (PC). Various options are available 
for this integration. For instance, the bibliographic information for one or more of the 
sources can be used to automatically create a bibliography for the document. In another 
example, content 104 can be imported directly into the body of the document in the form 
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of a quotation and an acknowledgement automatically generated within the body of the 
document containing the relevant bibliographic information. Altematively, such an 
acknowledgement for the imported content 104 can be automatically generated in the 
form of a footnote or endnote. 

As can be appreciated from the above description, a system and method that 
operate according to the above-described conceptuaUzation greatly simphfy the research 
and writing process for the researcher and would prevent transcription mistakes that can 
be made by the researcher when conventional research and writing techniques are used. 
The general concept of the invention having been described, an example system for 
capture and utilization of content and source information will be described with 
reference to FIGS. 2-5. Although this system is provided by way of example, it will 
be appreciated that this system is described for purposes of illustration only and that 
various modifications are feasible without departing from the above-described 
inventive concept. 

Referring now FIG. 2, illustrated is a system 200 for capture and utilization of 
content and source information. As indicated in this figure, the system 200 at least 
comprises a scanning device 202. The scanning device 202 typically comprises a self- 
contained, portable scanning device that, for instance, can be carried by the researcher 
(i.e., user) to a location (e.g., library) at which various sources 204 (e.g., books) can be 
found. By way of example, the scanning device 202 can include a housing 203 that is 
similar in configuration to a pen or highlighter with which the user can scan across 
text contained within a source 204 (typically a single line at a time), and scan across 
one or more data codes 206 (e.g., bar codes) of the source to capture source 
information that comprises bibliographic information or look-up information used to 
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retrieve the bibliographic information. As noted above, such a data code 206 can be 
provided on a binder of the source 204 (FIG. 2) or on a title page of the source. 

As is further indicated in FIG. 2, the system 200 can further include a 
computing device 206 with which the research document can be created and, as is 
5 described in greater detail below, the information captured by the scanning device 202 
can be received. As indicated in FIG. 2, this information can be, for example, 
wirelessly transmitted from the scanning device 202 to the computing device 206. It 
will be appreciated, however, that this information can be transmitted through a direct 
electrical or optical connection, if desired. The computing device 206 can take many 
10 different forms. For instance, the computing device 206 can be configxired as a 
desktop PC, laptop PC, or handheld computing device such as a tablet computing 
device, personal digital assistant (PDA), or mobile telephone. As will be appreciated 
by persons having ordinary skill in the art, the particular configuration of the 
computing device 206 is unimportant. The computing device 206 must merely be 
15 capable of receiving the information from the scanning device 202 such that it can be 
integrated into a research document or otherwise manipulated in some manner. 

The computing device 206 can, optionally, be connected to a network 208 so 
that information contained within a remote computing device 210 can be accessed, if 
desired. The network 208 can comprise one or more sub-networks that are 
20 communicatively coupled to each other. By way of example, these networks can 
include one or more local area networks (LANs) and/or wide area networks (WANs). 
Typically, however, the network 208 comprises a set of networks that forms part of the 
Litemet. As indicated in FIG. 2, the remote computing device 210 can comprise a 
network server. Although a network server is described and shown, it is to be 
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appreciated that the server is provided as an example only and that this representation 
is not intended to limit the scope of the present disclosure. 

FIG. 3 is a schematic view illustrating an example configuration for the 
scanning device 202 shown in FIG. 2. Although a particular configuration for the 
scanning device 202 is shown in FIG. 3 and described herein, it will be appreciated 
that ahemative configurations are feasible and may even be preferable, hideed, the 
scanning device 202 can comprise substantially any existing or yet to be produced 
portable scanning device. 

As indicated in FIG. 3, the scanning device 202 generally comprises a 
processing device 300 and scanning hardware 302, one or more input/output devices 
304, memory 306, a power supply 308, and one or more user interface devices 310 
that each are in electrical communication with the processing device. The processing 
device 300 is adapted to control operation of the other components of the scanning 
device 202 and execute commands stored in memory 306. The processing device 300 
can comprise a general-purpose processor, a microprocessor, one or more application- 
specific integrated circuits (ASICs), a plurality of suitably configured digital logic 
gates, and other well known electrical configurations that comprise discrete elements 
both individually and in various combinations to coordinate the overall operation of 
the scanning device 202. 

The scanning hardware 302 comprises the various components used to capture 
and store image information such as content, source information, and content location 
information. As indicated in FIG. 3, the scanning hardware 302 normally comprises 
one or more scan heads 312 and one or more hght sources 314. As will be 
appreciated by persons having ordinary skill in the art, the scanning device 202 can 

10 
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comprise a single scan head 312 that is responsible for scanning {i.e., capturing) both 
text and data code information. In an alternative arrangement, the scanning device 
202 can comprise two separate scan heads 312, one provided at each end of the 
scanning device. In such an arrangement, one of the scan heads is used to capture text 
(for content) and the other scan head is used to read data codes (for source 
information). In any case, the scan heads 312 can comprise a photosensor such as a 
charge coupled device (CCD). 

Where provided, the light sources 314 (e.g., one provided for each scan head 
312) can comprise a florescent, incandescent, or laser light source. Where the 
scanning device is used in environments in which enough ambient Hght exists, the 
hght sources 314 may not be necessary. As is further identified in FIG. 3, the 
scanning hardware 302 typically further comprises an analog-digital (A/D) converter 
316 that is used to convert the varying analog output of the scanning head(s) 312 into 
digital signals that can be manipulated by the scanning device 202 and/or the 
computing device 206 (FIG. 2). 

In addition to the above-noted components, the scanning hardware may 
include other components that are not shown for purposes of simplicity. For instance, 
the scanning hardware 302 may flirther include a focusing mechanism comprising one 
or more lenses, mirrors, etc. Furthermore, the scanning hardware 302 may also 
include a position encoder that provides information as to position and velocity 
information about the scan head(s) 312, and a compensation circuit for reducing 
image distortion which may occur during scanning. 

The input/output devices 304 can include a transceiver 318 and one or more 
communication ports 320 with which data can be transmitted to another device such 

11 
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as the computing device 206. The transceiver 318 can be configured for wireless 
communication that is facilitated through infrared (IR) transmission, radio frequency 
(RF) transmission, microwave transmission, or substantially any other wireless 
communication scheme. In addition or in exception, the communication ports 320 can 
5 be used to directly link the scanning device 202 with the other device through an 
electrical or optical connection such that data can be fransmitted to the other device 
and, optionally, received from the other device. 

The memory 306 can comprise a combination of volatile memory elements 
(e.g., random access memory (RAM, such as DRAM, SRAM, etc.)) and nonvolatile 
10 memory elements (e.g., ROM, hard drive, tape, CDROM, etc.) and is adapted to store 
various software (firmware) that comprises the various commands that, when 
executed by the processing device 300, control operation of the scanning device 202. 
The power supply 308 can comprise a portable power source that is integrated with 
the scanning device 202 such as a battery, or an external power supply that supphes 
1 5 the scanning device with direct current (DC) or alternating current (AC) power. 

The one or more user interface devices 310 typically comprise interface tools 
with which scanning device settings can be changed and through which the user can 
communicate commands to the device 202. By way of example, the user interface 
devices 206 can comprise one or more frmction keys 322 with which the operation of 
20 the scanning device 202 can be controlled and, optionally, a display 324 that is 
adapted to communicate graphical information to the user (e.g., a liquid crystal 
display (LCD)). 

FIG. 4 schematically illustrates an example configuration of scanning device 
memory 306 identified in FIG. 3. To faciUtate the description of the various software 

12 
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(firmware) disposed within memory 306, various software modules (e.g., programs) 
are depicted. Although separate modules are shown and described herein, persons 
having ordinary skill in the art will appreciate that these modules are provided as an 
example only in attempt to explain operation of the scanning device 202. Therefore, 
5 although specific modules are identified, one or more of these modules could be 
combined or divided as is deemed prudent. Furthermore, functions described herein 
as being performed by a particular module could be performed by other modules, as 
the software designer sees fit. 

As indicated in FIG. 4, memory 306 can, for example, comprise an operating 
10 system 400 and an information capture module 402. The operating system 400 
contains the various commands used to control the general operation of the scanning 
device 202 as well as the various software modules of memory 306. The information 
capture module 402 comprises software that is adapted to, in conjunction with the 
scanning hardware 302, capture information that can be, at least temporarily, stored by 
1 5 the scanning device 202 in data storage 408 and/or transmitted to another device (e.g. , 
computing device 206) for further manipulation. 

Further identified in FIG. 4 is an information association module 404 that, as 
is described in greater detail below, is adapted to associate various captured 
information (e.g., content) with other captured information (e.g., source information 
20 and content location information) so that the relationship between the various pieces 
of captured information can be tracked as data is manipulated in some manner (e.g., 
integrated into a user apphcation). Finally, memory 306 can further comprise an 
optical character recognition (OCR) module 406 that is adapted to recognize raw text 
data and reconfigure the data into a format in which it can be imported into another 

13 
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application (e.g., word processing application) as text. Where the captured 
information is to be shared with another device, such as the computing device 206 
provided with its own OCR module, the module 406 need not be provided to free up 
memory for other data and reduce the work for the processing device 300. 

FIG. 5 is a schematic view illustrating an example architecture for the 
computing device 206 that can be used to receive captured information transmitted 
from the scanning device 202 and, if desired, can be used to integrate the information 
into a user apphcation such as a word processing application. As indicated in FIG. 5, 
the computing device 206 can comprise a processing device 500, memory 502, one or 
more user interface devices 504, one or more communication devices 506, and a local 
interface 508 to which each of the other components electrically connects. The local 
interface 508 may have additional elements, which are omitted for simphcity, such as 
controllers, buffers (caches), drivers, repeaters, and receivers to enable 
coramunications. Furthermore, the local interface 508 may include address, control, 
and/or data connections to enable appropriate communications among the 
aforementioned components. 

The processing device 500 can include any custom made or commercially 
available processor, a central processing unit (CPU) or an auxiliary processor among 
several processors associated with the computing device 206, a semiconductor based 
microprocessor (in the form of a microchip), or a macroprocessor. The memory 502, 
like memory 306, can include any one of a combination of volatile memory elements 
(e.g., random access memory (RAM, such as DRAM, SRAM, etc.)) and nonvolatile 
memory elements (e.g., ROM, hard drive, tape, CDROM, etc.). 
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The user interface devices 504 typically comprise those normally used in 
conjunction with a computing device. For instance, the user interface devices 504 can 
comprise a keyboard, mouse, monitor or display screen, etc. The one or more 
communication devices 506 comprise the hardware with which the computing device 
5 206 can interface with the scanning device 202. By way of example, the 
communication devices 506 include a transceiver that is adapted to wirelessly transmit 
data to and wirelessly receive data from the scanning device 202. In addition, the 
communication devices 506 can include a device used to connect with the network 
208, such as a modem. 

10 The memory 502 comprises various software programs including an operating 

system 510, a user apphcation 512, and an information manager 514. The operating 
system 510 controls the execution of other software and provides scheduling, input- 
output control, file and data management, memory management, and communication 
control and related services. The user application 512 can comprise substantially any 

15 application in which the information received from the scanning device 202 can be 
used. For example, the user application 512 can comprise a word processing 
application such as Microsoft Word™ or WordPerfect^". As is discussed in greater 
detail below, the information manager 514 is adapted to receive data transmitted from 
the scanning device 202 and integrate it into the user application 512. 

20 The information manager 514 can include its own OCR module 516, 

information retrieval module 518, and a bibliographic database 520. Where provided, 
the OCR module 516 can receive raw captured data from the scanning device 202 and 
perform a character recognition on the data such that text represented by the data can 
be imported into the user application 512 as text. The information retrieval module 

15 
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518 is adapted to retrieve bibliographic information with reference to source 
information transmitted by the scanning device 202. As described below, this 
bibliographic information can be retrieved from the remote computing device 210 via 
the network 208. Alternatively, this information can be retrieved directly from the 
5 local bibliographic database 520 if this information is contained in the database. 

The remote computing device 210 (FIG. 2) can be configured in similar 
manner to the computing device 206. Accordingly, a detailed description of the 
remote computing device 210 is not provided here. However, it suffices to say that 
the remote computing device 210 typically comprises memory in which a fiorther 
10 database is stored that contains bibliographic information which can be accessed by 
the computing device 206 when needed. 

Various software and/or firmware programs have been described herein. It is to 
be understood that these programs can be stored on any computer readable medium for 
use by or in connection with any computer related system or method. In the context of 
15 this document, a computer readable medium is an electronic, magnetic, optical, or 
other physical device or means that can contain or store a computer program for use 
by or in connection with a computer related system or method. These programs can 
be embodied in any computer-readable medium for use by or in connection with an 
instruction execution system, apparatus, or device, such as a computer-based system, 
20 processor-containing system, or other system that can fetch the instructions from the 
instruction execution system, apparatus, or device and execute the instructions. In the 
context of this document, a "computer-readable medium" can be any means that can 
store, communicate, propagate, or transport the program for use by or in connection 
with the instruction execution system, apparatus, or device. 

16 
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The computer readable medium can be, for example but not limited to, an 
electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, 
apparatus, device, or propagation medium. More specific examples (a nonexhaustive 
list) of the computer-readable medium include an electrical connection having one or 
more wires, a portable computer diskette, a random access memory (RAM), a read- 
only memory (ROM), an erasable programmable read-only memory (EPROM, 
EEPROM, or Flash memory), an optical fiber, and a portable compact disc read-only 
memory (CDROM). Note that the computer-readable medium can even be paper or 
another suitable medium upon which a program is printed, as the program can be 
electronically captured, via for instance optical scanning of the paper or other 
medium, then compiled, interpreted or otherwise processed in a suitable manner if 
necessary, and then stored in a computer memory. 

An example system 200 having been described above, operation of the system 
will now be discussed. In the discussion that follows, flow diagrams are provided. It 
is to be understood that any process descriptions contained in blocks of these flow 
diagrams represent modules, segments, or portions of code which include one or more 
executable instructions for implementing specific logical functions or steps of the 
inventive method, and that ahemative implementations are feasible. Moreover, 
functions may be executed out of order from that shown or discussed, including 
substantially concurrentiy or in reverse order, depending on the functionality involved. 

FIGS. 6 A and 6B illustrate an example of operation of the scanning device 202 
in capturing information located during research that can be used in some manner, for 
instance in creating a document. Beginning with block 600 of FIG. 6A, the scanning 
device 202 is enabled. This enablement can comprise the powering of the device 

17 
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through selection of a power button provided on the device 202 or other function key. 
Once the scanning device 202 is enabled, it can receive a capture source information 
command, as indicated in block 602, entered with one or more of the function keys of 
the user interface devices 310. Receipt of such a command indicates to the scanning 

5 device 202 that source information will be captured with the scan head 312 of the 
device. The source information typically is obtained from data code information that 
is provided on or in the source (e.g., book). The source information may comprise 
bibliographic information for the source such as a title, author, publisher, and 
publication date. Alternatively, the source information can comprise information that 

1 0 identifies where such bibUographic information can be found such that it may later be 
retrieved. 

The scan head 312 scans the source information, as indicated in block 604, 
and, as indicated in block 606, the source information is stored in memory 306. As 
indicated above, the scanning device 202 can comprise a single scan head or separate 
15 text data code scan heads. Where two separate scan heads are provided, the scanning 
is conducted with the data code scan head. At this point, the scanning device 202 is 
then prepared to capture content contained within the source. Therefore, the scan 
head 312 then scans the content, as indicated in block 608. Where two scan heads 312 
are provided, this content capture can be accomplished with the text scan head. As 
20 with the captured source information, the captured content is stored in memory 306, as 
identified in block 610. Once the desired amount (e.g., a particular passage) of 
content has been captured and stored in this manner, content location information, for 
instance one or more page numbers on which the content was found, can further be 
captured and stored, if desired. To accomplish this, the scanning device 202 can 

18 
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switch capture modes, as indicated in block 612, for example in response to a location 
information capture command received from the user. This way, the scanning device 
202 will know that this information is to be used in creating the acknowledgment of 
the source when a document is later created. 

At this point, the scan head scans the content location information, as 
indicated in block 614, and the information is stored, as identified in block 616. 
Where the captured content begins on one page and continues onto another, the 
content location information can comprise a first page number and a second page 
number. In such a situation, the capture process may comprise first capturing the first 
page number, receiving an indication that the first page number has been captured, 
and then capturing the second page number. Alternatively, the scanning device 202 
can be configured to automatically recognize when two separate page numbers have 
been captured. Although the content location information is described as being 
provided as text, it is to be appreciated that other indicia, such as further data code 
information, can be used to identify the location of the content, if desired. 
Furthermore, although the content location information is described as being captured 
after the content is captured, it will be appreciated by persons having ordinary skill in 
the art that these steps could be reversed, if desired. 

Once the source information, content, and content location information have 
been captured in the manner described above, flow continues to block 618 in FIG. 6B 
at which this information is linked such that the captured content is associated with a 
particular source as well as a particular content location (where appHcable). This may 
comprise, for instance, the use of metadata tags applied to the various pieces of 
information. At this point, it can be determined whether OCR is to be performed on 

19 
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content and content location information, as identified in decision element 620. As 
identified above, performance of OCR with the scanning device 202 may not be 
necessary where the computing device 206 has this capability. Where OCR is not to 
be performed, flow can continue down to decision element 624 described below. If 
OCR is to be performed, however, flow continues to block 622 at which the OCR 
module 406 processes the captured content and content location information to 
identify characters. 

Referring now to decision element 624, it can be determined whether more 
content, and its attendant content location information, is to be captured from the 
source. If so, flow returns to block 608 at which the scan head 312 scans the content 
as described above. If no further content is to be captured from the source, however, 
flow continues to decision element 626 at which it is determined whether content is to 
be captured from a different source. If not, flow continues to decision element 628 
described below. If content from another source is to be captured, however, flow 
continues back to block 602 at which a capture source information command is again 
received. 

With reference now to decision element 628, it can be determined whether the 
captured information is to be transmitted to another device such as the computing 
device 206. By way of example, the transmission can occur immediately after the 
research is completed, for instance, where the user has brought a portable computing 
device (e.g., laptop PC) to the research location, or later where the user intends to 
transmit the information to a stationary computing device (e.g., desktop PC). If the 
information is to be transmitted, flow continues to block 630 at which the information 
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is transmitted via the input/output devices 304. If, on the other hand, the information 
is not to be transmitted at this time, the session is over and flow is terminated. 

Reference is now made to FIG. 7, which illustrates an example of operation of 
the information manager 514 of the computing device 206 shown in FIG. 5. As 
5 indicated in block 700, the data transmitted from the scanning device 202 is received. 
At this point, it can be determined whether OCR must be performed on the content 
and content location information that was captured in the research session, as 
indicated in decision element 702. This determination can be made through analysis 
of the received data to determine whether OCR has already been performed and/or 
10 whether the content and content location information comprises raw data. If OCR is 
not required, flow continues to decision element 706 described below. If OCR is 
required, however, it is performed on the raw data, as indicated in block 704. 

With reference to decision element 706, it is determined whether the 
bibliographic information must be retrieved. Such retrieval is necessary where the 
15 source information only comprised an identification of where bibhographic 
information is stored versus the bibhographic information itself. The identification 
can comprise, for instance, information used to look-up the bibliographic information 
in the bibliographic database 520 or an address (e.g., universal resource locator 
(URL)) that identifies where the bibliographic information is stored out on the 
20 network 208. If the source information comprises the bibhographic information, flow 
continues down to block 710 described below. If this information must be retrieved, 
however, flow continues to block 708 at which the bibliographic information is 
retrieved from the local database 520 or from a database of a remote device such as 
the remote computing device 210. 
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With reference to block 710, the captured information (and retrieved 
bibhographic information where applicable) can then be reconfigured for integration 
into a user appUcation such as a word processing appHcation. This reconfiguration 
can take many different forms depending upon the user's wishes. For instance, the 
5 user can be provided with the choice of having the information manager 514 
automatically create a bibUography for a document he or she is creating with the 
captured (and retrieved) information. In another example, the user can have the 
information manager 514 automatically generate footnotes or endnotes as specific 
pieces of content are imported into the document by the user. By way of example, 
10 these choices can be communicated by the user with a separate appHcation itself 
associated with the information manager 514 or can be communicated with the user 
application itself where the application is configured to receive such choices. 

At this point, the content can be integrated into the user appHcation, as 
indicated in block 712. This integration can result from, for instance, selection by the 
15 user of one or more content portions for importation into the document. In such a 
scenario, the information manager 514 can, at the choice of the user, automatically 
generate an appropriate acknowledgement of the source in the form of a notation in 
the bibliography, footnote, and/or endnote. Such importation can continue in this 
manner until all captured content that the user wishes to use has been added to the 
20 document. At this point, flow for the information manager 5 14 is terminated. 

From the above, it can be appreciated that research and document creation can 
be greatly simpHfied with a system for content and source information capture and 
integration such as that described above. While particular embodiments of the 
invention have been disclosed in detail in the foregoing description and drawings for 
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purposes of example, it will be understood by those skilled in the art that variations and 
modifications thereof can be made without departing from the scope of the invention as 
set forth in the following claims. 
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